Disentangled Graph Recurrent Network for Document Ranking

نویسندگان

چکیده

Abstract BERT-based ranking models are emerging for its superior natural language understanding ability. All word relations and representations in the concatenation of query document modeled self-attention matrix as latent knowledge. However, some knowledge has none or negative effect on relevance prediction between document. We model observable unobservable confounding factors a causal graph perform do-query to predict label given an intervention over this graph. For observed factors, we block back door path by adaptive masking method through transformer layer refine disentangled refinement layer. unobserved resolve do-operation from front decomposing into related unrelated parts decomposition Pairwise loss is mainly used ad hoc task, triangle distance introduced both layers more discriminative representations, mutual information constraints put Experimental results public benchmark datasets TREC Robust04 WebTrack2009-12 show that DGRe outperforms state-of-the-art baselines than 2% especially short queries.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Recurrent Neural Network for Document Modeling

This paper proposes a novel hierarchical recurrent neural network language model (HRNNLM) for document modeling. After establishing a RNN to capture the coherence between sentences in a document, HRNNLM integrates it as the sentence history information into the word level RNN to predict the word sequence with cross-sentence contextual information. A two-step training approach is designed, in wh...

متن کامل

An Ensemble Click Model for Web Document Ranking

Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...

متن کامل

Sentence Ranking for Document Indexing

This article discusses a new document indexing scheme for information retrieval. For a structured (e.g., scientific) document, Pasi et al. proposed varying weights to different sections according to their importance in the document. This concept is extended here to unstructured documents. Each sentence in a document is initially assigned weights (significance in the document) with the help of a...

متن کامل

Document Modeling with Gated Recurrent Neural Network for Sentiment Classification

Document level sentiment classification remains a challenge: encoding the intrinsic relations between sentences in the semantic meaning of a document. To address this, we introduce a neural network model to learn vector-based document representation in a unified, bottom-up fashion. The model first learns sentence representation with convolutional neural network or long short-term memory. Afterw...

متن کامل

Latent Document Re-Ranking

The problem of re-ranking initial retrieval results exploring the intrinsic structure of documents is widely researched in information retrieval (IR) and has attracted a considerable amount of time and study. However, one of the drawbacks is that those algorithms treat queries and documents separately. Furthermore, most of the approaches are predominantly built upon graph-based methods, which m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Science and Engineering

سال: 2022

ISSN: ['2364-1541', '2364-1185']

DOI: https://doi.org/10.1007/s41019-022-00179-3